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ABSTRACT 

In  this  paper  we  present  a  method  for  clustering  and  classification  of  acoustic  color  data  based  on  statistical 
analysis  of  functions  using  square-root  velocity  functions  (SVRF).  The  convenience  of  the  SVRF  is  that  it  trans¬ 
forms  the  Fisher-Rao  metric  into  the  standard  L2  metric.  As  a  result,  a  formal  distance  can  be  calculated  using 
geodesic  paths.  Moreover,  this  method  allows  optimal  deformations  between  acoustic  color  data  to  be  computed 
for  any  two  targets  allowing  for  robustness  to  measurement  error.  Using  the  SVRF  formulation  statistical  models 
can  then  be  constructed  using  principal  component  analysis  to  model  the  functional  variation  of  acoustic  color 
data.  Empirical  results  demonstrate  the  utility  of  functional  data  analysis  for  improving  performance  results  in 
pattern  recognition  using  acoustic  color  data. 
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1.  INTRODUCTION 

The  problem  of  underwater  object  detection  and  classification  capture  using  sonar  has  attracted  a  substantial 
amount  of  attention. 1-5  This  problem  is  complicated  due  to  various  factors  such  as  variations  in  operating  and 
environmental  conditions,  presence  of  spatially  varying  clutter,  variations  in  target  shapes,  compositions  and 
orientation.  Moreover,  bottom  features  such  as  coral  reefs,  sand  formations,  and  vegetation  may  totally  obscure 
a  target  or  confuse  the  detection  process.  Consequently,  a  robust  detection  system  should  be  able  to  quantify 
changes  between  the  returns  from  the  bottom  and  any  target  activity  in  sonar  images,  while  at  the  same  time 
extract  useful  features  for  subsequent  classification.  Thus,  a  robust  system  designed  to  mitigate  false  alarms  in 
various  clutter  density  scenarios  will  be  desirable. 

Considerable  research  has  been  devoted  to  the  development  of  different  detector  and  classification  method¬ 
ologies  to  detect  and  classify  underwater  objects  utilizing  sonar  imagery.  Dobeck1,6  utilized  a  nonlinear  matched 
filter  to  identify  mine-size  regions  that  match  the  target  template  in  a  side-scan  sonar  image.  For  each  detected 
region,  several  features  were  extracted  based  on  the  size,  shape,  and  strength  of  the  target  template.  A  stepwise 
feature  selection  process  was  then  used  to  determine  the  subset  of  features  that  maximizes  the  probability  of 
detection  and  classification.  A  k-nearest  neighbor  and  an  optimal  discrimination  filter  classifier  were  used  to  clas¬ 
sify  each  feature  vector  and  the  decisions  of  the  two  classifiers  were  fused  to  generate  the  final  decision.  In,2  the 
adaptive  clutter  filter  detector  in'  was  individually  applied  to  three  different  sonar  images  varying  in  frequency 
and  bandwidth.  Final  classification  is  done  using  an  optimal  set  of  features  using  a  nonlinear  log-likelihood 
ratio  test  where  the  decisions  of  the  individual  detector  and  classifier  are  fused.  Recently  in4  we  developed  a 
new  coherence-based  detection  framework  for  dual-sensor  problem  using  Canonical  Coordinate  Analysis  (CCA) 
that  can  be  applied  to  the  data  collected  using  two  disparate  sonar  systems.  Using  this  method  allows  for  the 
simultaneous  detection  and  feature  extraction  of  coherent  target  information  among  two  sonar  images. 

These  methods  all  use  traditional  synthetic  aperture  sonar  (SAS)  images  which  often  provides  high  quality 
images  of  proud  targets  which  are  useful  for  image  based  detection,  localization,  and  identification  algorithms; 
however,  this  is  not  the  case  for  buried  targets  where  images  are  usually  blurred  with  less  structure  definition, 
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and  hence,  target  identification  from  these  images  is  more  difficult.  Generating  acoustic  color  data  products  is 
one  way  to  overcome  these  shortcomings.  Acoustic  color8,9  is  a  simple,  spectral-based  method  that  generates 
a  normalized  plot  showing  the  strengths  of  the  return  signatures  off  an  object  in  individual  frequency  bands 
at  various  aspects  that  may  provide  features  useful  for  identification.  The  problem  is  then  how  does  one  do 
statistical  analysis  on  these  acoustic  color  images,  which  are  essentially  a  two-dimensional  function  of  frequency 
and  aspect  which  represents  target  strength. 

The  solution  to  this  problem  is  statistical  functional  data  analysis.  However,  one  must  be  aware  of  a  problem 
that  manifests  itself  when  doing  classification  and  pattern  recognition  problems  on  this  type  of  data.  The  data 
can  be  sensitive  to  sensor  placement  and  alignment  between  samples  over  a  target.  In  some  instances,  it  may  be 
relatively  easy  to  decide  how  to  warp  functions  from  different  samples  for  proper  alignment,  however  this  can 
become  quite  cumbersome  for  large  data-sets  and  tends  to  be  done  in  a  supervised  fashion.  In  this  paper,  we 
present  a  framework  for  unified  warping  and  comparison  of  function  data  using  recent  developments  from  shape 
theory  in  statistics  and  use  it  for  the  alignment  and  classification  of  acoustic  color  data.  Using  a  differential 
geometric  approach,  our  method  unlike  previous  methods  in  functional  data  analysis,  allows  warping  and  analysis 
in  a  single  step.  In  other  words  we  are  able  to  warp  the  functions,  calculate  metrics,  and  naturally  decompose  the 
x  and  y  variability  of  the  data  in  a  efficient  unsupervised  manner.  Moreover,  under  this  framework  a  mapping 
called  a  square-root  velocity  function  (SVRF)  provides  simplification  of  the  metric  where  the  standard  Fisher- 
Rao  metric  becomes  the  standard  L2  inner  product  and  the  standard  cross-section  statistical  analysis  becomes 
applicable.  In  this  way,  one  calculate  the  mean  of  the  SRVFs  while  optimizing  over  the  warping  of  the  mean 
function. 

This  paper  is  organized  as  follows:  Section  2  presents  and  reviews  the  current  theory  in  elastic  function 
alignment  and  presents  the  Fisher  Rao  Metric  and  the  method  for  warping  the  data.  Section  3  presents  the 
results  of  this  method  when  applied  to  acoustic  color  data  presenting  both  the  warping  and  separability  of  the 
metric  in  feature  space.  Finally,  conclusions  and  observations  are  offered  in  Section  4. 

2.  FUNCTION  REPRESENTATION  AND  ALIGNMENT 

This  work  is  motivated  by  recent  developments  in  statistical  shape  analysis  of  parameterized  curves10-12  in  which 
we  take  a  differential  geometric  approach  to  statistical  functional  analysis  of  elastic  functions.13  Consider  any 
two  functions  /i,/2  £  T,  where  T  is  an  appropriate  space  of  functions  on  [0,1],  and  7  £  T.  The  main  problem 
solved  in13  is  can  we  find  a  distance  on  T  such  that  d(/i,/2)  =  d{f\  o  7,  /2  07),  for  all  /1 ,  /2 ,  and  7. 

The  solution  comes  from  a  generalization  of  the  square-root  representation,  which  under  this  transformation 
the  Fisher-Rao  Riemannian  metric  becomes  the  L2  inner  product  and  the  standard  cross-sectional  functional 
analysis  becomes  applicable.  Moreover,  by  computing  the  mean  of  the  SRVFs  we  can  provide  an  alignment  of 
the  individual  functions  to  the  mean  and  a  covariance  operator  that  can  be  further  used  for  functional  principal 
component  analysis  (FPCA) .  The  important  aspect  of  this  method  is  that  all  the  analysis  of  computing  statistics, 
modeling,  principal  components  is  performed  in  the  L2  space  of  SRVFs  and  the  final  results  are  converted  back 
to  the  original  function  space.  The  reasons  for  this  is  that  the  L2  metric  is  a  proper  distance  for  analyzing  SRVFs 
of  elastic  functions  and  allows  for  a  formal  statistical  analysis.  The  method  will  be  briefly  described  below,  for 
a  more  in-depth  discussion  see.13 

2.1  Pairwise  Alignment  Problem 

Let  r  be  the  group  of  all  warping  functions 

T  =  {7  :  [0, 1]  — »  [0, 1] |7(0)  =  0, 7(1)  =  1, 7  is  a  diffeomorpliism}  (1) 

and  it  acts  on  the  function  space  by  composition  (/,  y)  =  f  07.  It  is  common  to  use  the  the  following  function 
for  alignment 

min||/i  07- /2||  (2) 

7EI 

which  however  is  not  a  distance  function  and  is  not  symmetric  which  inhibits  the  definition  of  any  statistical 
measures.  It  can  however  be  made  symmetric  by  using  double  optimization,  but  still  fails  in  being  a  distance 
function  to  due  lack  of  isometry.  The  solution  to  this  problem  is  the  Fisher-Rao  distance. 


Figure  1.  Ensemble  alignment  problem. 


2.2  Fisher  Rao  Metric 

Let  /  represent  a  real- valued  continuous  function  on  the  interval  [0,1]  and  let  T  denote  the  set  of  all  such 
functions.  For  any  v\,v2  £  Tf(F),  the  Fisher- Rao  Riemannian  Metric  is  defined  as 


1  f1 

({vi,v2  ))f  =  -  vi  (t)v2(t)-j 

4  Jo  \j 


\m\ 


dt 


(3) 


which  is  invariant  to  re-parameterization,  ((v\,v2)) /  =  (((vioy),  (f2°7))) /o7  and  leads  to  the  Fisher-Rao  distance 


dFR{fuh)=  inf 

a:[0,l]->Jr,a(0)=/i,a(l)=/2 


(((d(T),d(r)))Q(T))1/2dr)  . 


(4) 


This  distance  satisfies,  c£f.r(/i>  f2)  =  dFFt(fi  °7,  f2°j)  and  is  invariant  to  domain  warping,  however  it  is  difficult 
to  compute  and  only  a  few  numerical  solutions  attempt  to  solve  it. 

We  can  overcome  this  problem  by  defining  a  mapping 


Q(:r)  =  (X/V 'N  ifN^° 

[  0  otherwise. 


(5) 


For  studying  /,  we  will  represent  it  using  the  SRVF  defined  as  q  :  [0, 1]  — >  R,  where  q{t)  =  Q(f(t))  =  |/(t)| 

and  if  /  is  continuous,  then  q  is  square- integrable.  The  SVRF  can  then  be  reparameterized  by  (q,  7)  =  (qo  7)1/7 
and  the  space  of  all  SVRFs  is  L2([0, 1],  R)  or  L2.  Therefore,  under  the  SVRF  representation  of  /,  the  Fisher-Rao 
Riemannian  metric  becomes  the  L2  distance,  i.e. ,  dpR  =  ||9i  —  q2 1|.  The  proof  of  which  can  be  found  in.13 
Moreover,  for  any  two  SVRFs  q\,q2  £  L2  and  7  G  F,  we  have  11(91,7)  —  (92,7)!]  =  ||9i  —  92 1|- 

2.3  Ensemble  Alignment 

Now  that  the  Fisher-Rao  distance  on  J-  has  been  defined,  we  can  now  define  the  warping  framework  to  align  the 
functions  horizontally  to  improve  the  matching  of  features  (peaks  and  valleys)  across  functions.  In  the  SVRF 
framework  this  is  accomplished  naturally  using  Karcher  mean.  If  we  define  S  to  be  the  quotient  space  S  =  L2/T, 
then  S  is  the  set  of  all  orbits  of  an  SVRF;  [9]  =  closure{(9, 7)|y  G  T}.  We  can  then  align  the  functions  by 
computing  a  mean  function  in  S  and  during  that  process  we  warp  the  SRVFs  of  the  given  functions  to  match 
the  mean  function.  A  graphical  diagram  of  this  process  is  given  in  Figure  1. 

Lets  define  a  distance  function  on  S 

d0{[qi],  [92])  =  min  ||9!  —  (92  °7)V7'II  (6) 


which  is  a  proper  distance  function  and  the  Karcher  mean  can  be  computed  using  this  distance  function  in 


H  =  argmin  d0([g],  [g*])2.  (7) 

qe  l2 

The  above  optimization  problem  can  be  solved  by  the  following  algorithm  which  relies  on  dynamic  programming. 

Algorithm  1:  Karcher  Mean  of  SRVFs 

1.  Initialization:  Select  p,  =  q. i  for  some  i  €  {1,  2, . . . ,  n} 

2.  For  each  g^,  i  =  [1,  2, . . . ,  n]  compute  the  optimal  warping  function 

7*  =  arg  min|  |/r  -  (qt  o  7)  -//I  I  (8) 

7GT 

which  can  be  solved  using  dynamic  programming. 

3.  Compute  the  aligned  SRVFs  qi  =  ( qi  o  7*)y^f 

4.  If  the  increment,  ||^  q^  —  /r||  is  small,  then  continue.  Else  update  the  mean  y,  1— >  ^  ft  and  return 
to  step  2. 


This  algorithm  results  in  the  Karcher  mean  y,  the  aligned  SRVFs  {gi},  and  the  optimal  warping  functions 
{7*}.  We  can  then  convert  the  aligned  SRVFs  to  aligned  functions  using 

ft  ^  fi{t)  =  /i(0)  +  [  gJ(s)|gi(s)|  ds  (9) 

J  0 

and  we  now  have  the  desired  components:  the  y  variability  {/,}  and  the  x  variability  {7*}.  In  the  SRVFs  we 
can  then  calculate  proper  distances,  metrics,  perform  classification,  and  perform  function  principal  component 
analysis  and  impose  probability  models  on  the  coefficients  and  then  bring  the  results  back  to  the  function  space 
using  integration. 


3.  EXPERIMENTAL  RESULTS 

In  this  section,  we  describe  some  experimental  results  to  demonstrate  the  use  of  this  elastic  functional  analysis  on 
acoustic  color  data.  First,  we  present  results  of  the  warping  of  the  acoustic  color  data  and  how  it  improves  feature 
alignment.  Second,  results  showing  the  separability  of  the  warped  data  over  the  original  data  and  classification 
rates  between  the  two  using  the  L2  metric  defined  in  Section  2  and  calculated  using  SRVFs. 

3.1  Data  Description 

The  data  used  in  these  experiments  was  collected  at  the  NSWC  PCD  test  pond.  For  a  description  of  the  pond 
and  experimental  setup  the  reader  is  referred  to.14  The  raw  sonar  data  was  collected  using  a  1  -  30 kHz  LFM 
chirp  and  data  was  collected  for  a  11  proud  targets  that  included  a  solid  aluminum  cylinder,  an  aluminum 
pipe,  an  inert  81  mm  mortar  (filled  with  cement),  a  solid  steel  artillery  shell,  two  machined  aluminum  UXOs,  a 
machined  steel  UXO,  a  de-militarized  152mm  TP-T  round,  a  de-militarized  155mm  empty  projectile  (without 
fuse  or  lifting  eye),  a  small  aluminum  cylinder  with  a  notch,  and  two  rocks  with  sizes  comparable  to  the  UXO 
targets.  The  aluminum  cylinder  is  2 ft  long  with  a  1  ft  diameter;  while  the  pipe  is  2 ft  long  with  an  inner  diameter 
of  1  ft  and  3/8  inch  wall  thickness. 

The  acoustic  color  data  was  generated  from  the  raw  sonar  data  to  construct  target  strength  as  a  function 
of  frequency  and  aspect  angle.  Due  to  the  relatively  small  separation  distances  between  the  targets  in  the 
experimental  setup,  the  scattered  Helds  from  the  targets  overlap.  To  generate  the  acoustic  templates,  SAS 
images  were  formed  and  then  an  inverse  imaging  technique  was  used  to  isolate  the  response  of  an  individual 
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(a)  Aluminum  Pipe  Pass 


0  5  10  15  20  25  30 


20 

■40 

60 

80 

■20 

40 

60 

■80 


Freq  (kHz) 

(b)  Aluminum  Pipe  Pass  #2 

Figure  2.  Example  acoustic  color  map  for  the  aluminum  pipe  for  two  different  target  passes. 
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(a)  Original  Data  (/) 
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(c)  Warping  Function  (7*) 

Figure  3.  Original  data  and  aligned  functions  for  aluminum  pipe  at  0°  aspect. 

target  and  to  suppress  reverberation  noise.  A  brief  summary  of  this  process  is  as  follows:  The  raw  sonar  data 
is  matched  filtered  and  the  SAS  image  is  formed  using  the  u>  —  k  beamformer.15  The  target  is  then  located  in 
the  SAS  image  and  is  then  windowed  about  selected  location.  This  windowed  image  contains  the  information  to 
reconstruct  the  time  signals  associated  with  a  given  target  via  inverting  the  ui  —  k  beamformer.16 

From  the  11  targets,  9  different  data  collections  runs  were  done,  and  99  acoustic  color  templates  were  generated 
using  the  method  described  above,  see  Figure  2  for  an  example  acoustic  color  map  for  the  aluminum  pipe  for  two 
different  data  collection  runs.  From  the  acoustic  color  maps,  one-dimensional  functional  data  was  generated  by 
taking  slices  at  aspect  values  of  — 10°,  0°,  10°  and  therefore  generating  297  data  samples.  The  two  figures  look 
similar,  however  there  are  discrepancies  between  the  two  maps  notably  at  larger  aspect  angles  and  the  magnitude 
of  the  function.  These  discrepancies  and  misalignment  can  greatly  inhibit  classification  performance. 

3.2  Data  Warping 

The  first  results  we  present  are  the  warping  results  on  an  example  object  from  the  above  described  data  set. 
Figure  3a  presents  the  original  target  strength  acoustic  curves  for  the  aluminum  pipe  at  0°  aspect.  These 
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Figure  4.  Original  data  and  aligned  functions  statistics  at  0°  aspect. 


represent  27  different  target  passes  over  the  aluminum  pipe  and  are  one  slice  from  the  acoustic  color  map.  The 
frequency  dependent  target  strength  functions  are  somewhat  similar,  but  we  can  see  variations  in  strength  and 
small  shifts  in  location  of  the  features  of  the  function  (i.e. ,  peaks  and  valleys). 

Figure  3b  presents  the  resulting  aligned  function  {_/)}  using  Algorithm  1  presented  in  Section  2.3  and  Figure  3c 
presents  the  corresponding  warping  functions  {7*}.  The  plots  of  {/)}  show  tighter  alignment  of  the  functions 
with  more  defined  peaks  and  valleys,  representing  more  tighter  alignment  of  the  measured  data  of  the  same 
physical  phenomena.  Figure  4a  and  4b  present  the  cross-sectional  mean  ±  standard  deviation  of  the  original  and 
warped  data,  respectively.  The  difference  between  the  original  and  warped  data  can  be  noted  by  the  spread  of 
the  standard  deviation  from  the  mean  and  for  the  warped  data  it  is  much  tighter  with  the  peaks  more  defined 
around  certain  frequencies. 

3.3  Object  Clustering  and  Classification 

Next,  we  present  the  results  of  object  clustering  and  classification  of  the  11  objects  in  the  acoustic  color  data 
set.  The  first  step  in  processing  was  each  object  was  aligned  in  class  using  Algorithm  1.  After  the  aligned  data 
and  warping  functions  were  calculated  the  distance  between  each  of  the  functions  in  the  data  set  was  calculated 
using  do  and  was  done  in  SVRF  space  where  the  distance  is  easily  calculated  using  the  L2  metric.  Figure  5a  and 
5b  presents  the  symmetric  distance  matrix  containing  the  pairwise  distances  for  the  original  data  {g^}  and  the 
warped  data  {gf},  respectively.  It  can  easily  be  seen  from  the  matrices  there  is  a  definite  separation  in  Figure  5b 
over  Figure  5a  where  the  distances  in  class  have  been  decreased  and  distances  out  of  class  have  been  increased. 
There  still  remains  a  few  difficult  separation  points  for  the  aluminum  pipe  and  solid  aluminum  cylinder. 

The  distance  matrices  in  Figure  5  are  clustered  using  their  3-D  embeddings  for  two  different  manifold  learning 
techniques,  multidimensional  scaling  (MDS)  and  Diffusion  Maps  to  demonstrate  the  discriminating  ability  of 
these  two  metrics.  Figure  6  shows  the  results  for  the  original  and  warped  data  using  both  MDS  and  diffusion 
maps.  For  the  original  data,  we  can  see  that  MDS  was  able  to  separate  only  one  of  the  11  objects  and  there  is  no 
separation  for  diffusion  map.  However  after  alignment  using  developed  algorithm,  both  methods  have  excellent 
clustering  results  for  both  embedding  techniques. 

Figure  7a  and  7b  presents  the  precision  versus  recall  plots  for  the  original  and  warped  data,  respectively. 
Precision  is  defined  to  be  the  true  positive  rate  divided  by  the  sum  of  the  true  positive  rate  and  false  positive 
rate  and  where  recall  is  defined  to  be  true  positive  rate  divided  by  the  sum  of  the  true  positive  rate  and  the 
false  negative  rate.  In  a  classification  task,  a  precision  score  of  1.0  for  a  class  C  means  that  every  item  labeled 
as  belonging  to  class  C  does  indeed  belong  to  class  C  (but  says  nothing  about  the  number  of  items  from  class 


250 


250 


200 


150 


100 


50 


value 

0 
2 
4 
6 
8 
10 
12 


i  i  i  r 

100  150  200  250  50  100  150  200  250 

(a)  Original  Data  ( q )  (b)  Warped  Data  ( q ) 

Figure  5.  Pairwise  distance  matrices  calculated  using  do- 
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Figure  6.  Target  clusters  for  acoustic  color  metric  using  do. 
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Figure  7.  Precision  versus  recall  plot  for  pairwise  distance  calculated  using  do. 


C  that  were  not  labeled  correctly)  whereas  a  recall  of  1.0  means  that  every  item  from  class  C  was  labeled  as 
belonging  to  class  C  (but  says  nothing  about  how  many  other  items  were  incorrectly  also  labeled  as  belonging 
to  class  C). 

By  examining  Figure  7a  the  precision  recall  plot  drops  off  very  rapidly  for  a  low  recall  demonstrating  a  very 
lackluster  classification  performance  of  the  original  data  using  the  distance  metric.  However,  after  warping  and 
alignment  and  using  the  geodesic  distance  the  performance  is  increased  to  nearly  99%  which  is  demonstrated  in 
Figure  7b  which  shows  a  nearly  perfect  precision  recall  plot. 

4.  CONCLUSIONS  AND  OBSERVATIONS 

The  statistical  analysis  and  classification  of  acoustic  color  data  is  a  challenging  task.  We  have  proposed  a  novel 
approach  for  the  classification  of  acoustic  color  using  elastic  functional  data  analysis  with  a  new  approach  from 
the  emerging  field  of  shape  analysis  curves.  We  have  demonstrated  a  method  which  aligns  the  data  and  a  way  to 
easily  calculate  a  proper  Riemannian  distance  between  functions,  which  may  be  used  for  classification  purposes. 
Aligning  the  data  allows  one  to  overcome  measurement  error  between  target  passes  and  preliminary  results  on 
the  provided  data  set  show  good  separation  among  targets  in  feature  space. 
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